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Abstract —We present the construction of a new family of 
erasure correcting codes for distributed storage that yield low 
repair bandwidth and low repair complexity. The construction 
is based on two classes of parity symbols. The primary goal 
of the first class of symbols is to provide good fault tolerance, 
while the second class facilitates node repair, reducing the repair 
bandwidth and the repair complexity. We compare the proposed 
codes with other codes proposed in the literature. 

I. Introduction 

Distributed storage (DS) uses a network of interconnected 
inexpensive storage devices (referred to as storage nodes or 
simply nodes) to store data reliably over long periods of time. 
Reliability against node failures (commonly referred to as fault 
tolerance ) is achieved by means of erasure correcting coding. 
Furthermore, when a node fails, a new node needs to be added 
to the DS network and populated with data to maintain the 
initial state of reliability. The problem of repairing a failed 
node is known as the repair problem. 

Classical maximum distance separable (MDS) codes are op¬ 
timal in terms of the fault tolerance/storage overhead tradeoff. 
However, the repair of a failed node requires the retrieval of 
large amounts of data from a large subset of nodes. Therefore, 
in the recent years, the design of erasure correcting codes that 
reduce the cost of repair has attracted a great deal of attention. 
Pyramid codes m were one of the first code constructions that 
addressed this problem. In particular, they aim at reducing the 
number of nodes that need to be contacted to repair a failed 
node, known as the repair access. Other codes that reduce the 
repair access are the local reconstruction codes (LRCs) 0, 
and the locally repairable codes a, a. 

Other code constructions aim at reducing the repair band¬ 
width, defined as the amount of information that needs 
to be read from the DS network to repair a failed node. 
Among them, we can mention minimum disk I/O repairable 
(MDR) codes 0, Zigzag codes 0 and piggyback codes 
0. Piggybacking consists of adding carefully chosen linear 
combinations of data symbols (called piggybacks) to the parity 
symbols of a given erasure correcting code. This results in 
a lower repair bandwidth at the expense of a lower erasure 
correcting capability with respect to the original code. 

In this paper, we propose a family of erasure correcting 
codes that achieve low repair bandwidth and low repair com¬ 
plexity. In particular, we propose a systematic code construe- 
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Fig. 1: System model. 


tion based on two classes of parity symbols. Correspondingly, 
there are two classes of parity nodes. The first class of parity 
nodes, whose primary goal is to provide erasure correcting 
capability, is constructed using an MDS code modified by 
applying specially designed piggybacks to some of its code 
symbols. The second class of parity nodes is constructed using 
a block code whose parity symbols are obtained with simple 
additions. This class of parity nodes does not have the purpose 
to bring any additional erasure correcting capability, but to 
facilitate node repair at low repair bandwidth and low repair 
complexity. In the paper, we compare the proposed codes with 
MDR codes. Zigzag codes, piggyback codes and LRCs (2, in 
terms of repair bandwidth and repair complexity. 

Notation: We define the operator ( a+b)k — (a+b ) mod k. 
The Galois field of order q p is denoted by F gP . 

II. Code Construction 

We consider the distributed storage system depicted in 
Fig. Q] There are k data nodes, each containing a very large 
number of data symbols over F gP . As we shall see in the 
sequel, the proposed code construction works with blocks of 
k data symbols per node. Thus, without loss of generality, we 
assume that each node contains k data symbols. We denote 
by dij, i,j = 0,..., k — 1, the ith data symbol in the jth 
data node. We say that the data symbols form a k x k data 
array D, where dj ,j = \D], :r For later use, we also define 
the set of data symbols V = {d, :l }. Further, there are n — k 
parity nodes each storing k parity symbols. We denote by Pij, 
i = 0 ,...,k — 1, j = k,... ,n — 1, the ith parity symbol in 
the jth parity node, and define the set V 3 as the set of parity 
symbols in the jth parity node. The set of all parity symbols 
is denoted by V = U, {Vj}. We say that the data and parity 
symbols form a k x n code array C, where Cjj = [C],j. 
Note that c t j = dij for i,j = 0,..., k — 1 and q j = p 3 j for 

























































































































i = 0,..., k — 1, j = k,..., n — 1. 

Our main goal is to construct codes that yield low re¬ 
pair bandwidth and low repair complexity of a single failed 
systematic node. To this purpose, we construct a family of 
systematic (n, k) codes consisting of two different classes 
of parity symbols. Correspondingly, there are two classes of 
parity nodes, referred to as Class A and Class B parity nodes, 
as shown in Fig. Q] Class A and Class B parity nodes are built 
using an (tia,A;) code and an (riB,k) code, respectively, such 
that n = riA-T «b — k. In other words, the parity nodes of the 
(n, k) codqj correspond to the parity nodes of Class A and 
Class B codes. The primary goal of Class A parity nodes is to 
achieve good erasure correcting capability, while the purpose 
of Class B nodes is to yield low repair bandwidth and low 
repair complexity. In particular, we focus on the repair of data 
nodes. The repair bandwidth (in bits) per node, denoted by 7 , 
is proportional to the average number of symbols (data and 
parity) that need to be read to repair a data symbol, denoted 
by A. More precisely, let (3 be the number of symbols per 
no dfl Then, 


where v = |~log 2 q p ~\ is the size (in bits) of a symbol. A can 
be interpreted as the repair bandwidth normalized by the size 
(in bits) of a node. Therefore, in the rest of the paper we will 
use A to refer to the normalized repair bandwidth. 

The main principle behind our code construction is the 
following. The repair is performed one symbol at a time. After 
the repair of a data symbol is accomplished, the symbols read 
to repair that symbol are cached in the memory. Therefore, 
they can be used to repair the remaining data symbols at no 
additional read cost. The proposed codes are constructed in 
such a way that the repair of a new data symbol requires a 
low additional read cost (defined as the number of additional 
symbols that need to be read to repair the data symbol), so that 
A (hence 7 ) is reduced. Since we will often use the concepts 
of read cost and additional read cost in the remainder of the 
paper, we define them in the following. 

Definition 1: The read cost of a symbol is the number 
of symbols that need to be read to repair the symbol. The 
additional read cost of a symbol is the additional number of 
symbols that need to be read to repair the symbol, considering 
that other symbols are already cached in the memory (i.e., have 
been read to recover some other data symbols previously). 

III. Class A Parity Nodes 

Class A parity nodes are constructed using a modified 
(tia, k) MDS code, k + 2 < n/\ < 2k, over F gP . In particular, 
we start from an (tza, k) MDS code and apply piggybacks Q 
to some of the parity symbols. The construction of Class A 
parity nodes is performed in two steps as follows. 

'With some abuse of language we refer to the nodes storing the parity 
symbols of a code as the parity nodes of the code. 

2 For our code construction, (3 = k, but this is not the case in general. 
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Fig. 2: A (7, 5) Class A code with r = 1 constructed from a (7, 5) MDS 
code. Vf and Vf are the parity nodes. For each row j, colored symbols 
belong to 7 Zj. 

1) Encode each row of the data array using an (?za, /<’) MDS 
code (the same for each row). The parity symbol pX if] 

fc-i 

Pij = ^2 a kj d iX j = k,... ,n A - 1, ( 2 ) 

1=0 

where aij denotes a coefficient in ¥ qV . Store the parity 
symbols in the corresponding row of the code array. 
Overall, k{np, — k) parity symbols are generated. 

2) Modify some of the parity symbols by adding piggy¬ 
backs. Let r, 1 < r < 7 Za — k — 1, be the number of 
piggybacks introduced per row. The parity symbol p t , u 
is obtained as 

Pi,u = Pi,u "A d(i-\-u— n A +T+l)fc,i) (3) 

where u = n/\ — t, ... ,n/\ — 1 and the second term in 
the summation is the piggyback. 

The fault tolerance (i.e., the number of node failures that 
can be tolerated) of Class A codes is given in the following 
theorem. 

Theorem 1: An (zza, k) Class A code with r piggybacks per 
row can correct a minimum of ?za — k — r + 1 node failures. 

Proof: The proof is given in the appendix. ■ 

When a failure of a data node occurs. Class A parity nodes 
are used to repair r + 1 of the k failed symbols. The Class A 
parity symbols are constructed in such a way that, when node 
j is erased, r + 1 data symbols in this node can be repaired 
reading the (non-failed) k — 1 data symbols in the jth row of 
the data array and r+1 parity symbols in the jth row of Class 
A nodes (see also Section II V-Cb . For later use, we define the 
set IZj as follows. 

Definition 2: For j = 0,..., k — 1, define the set IZj as 

'R'j — { d j,(i+Dk i d j,(j + 2 )fc i''') d j,(j+k—i)k }■ 

The set IZj is the set of k — 1 data symbols that are read 
from row j to recover r + 1 data symbols of node j using 
Class A parity nodes. 

Example 1: An example of Class A code is shown in Fig. [2] 
One can verify that the code can correct any 2 node failures. 


3 We use the superscript A to indicate that the parity symbol is stored in a 
Class A parity node. 




















For each row j, the set Rj is indicated in red color. For 
instance, TZq = {cZo,i■> do, 2 , cfo, 3 , ^ 0 , 4 }- 

The main purpose of Class A parity nodes is to provide good 
erasure correcting capability. However, the use of piggybacks 
helps also in reducing the number of symbols that need to 
be read to repair the r + 1 symbols of a failed node that are 
repaired using Class A code, as compared to MDS codes. The 
remaining k — r — 1 data symbols of the failed node can also 
be recovered from Class A parity nodes, but at a high symbol 
read cost. Hence, the idea is to add another class of parity 
nodes, namely Class B parity nodes, in such a way that these 
symbols can be recovered with lower read cost. 

IV. Class B Parity Nodes 

Class B parity nodes are obtained using an (ns,k) linear 
block code over F gP to encode the k x k data symbols of the 
data array, i.e., we use the (ns, k) code k times. This generates 
(ns — k) x k Class B parity symbols, p B u , i = 0,..., k — 1, 
u = riA,..., n — 1. 

Definition 3: For j = 0,1, define the set Qj as 

Qj ~ {^(j+T+l)fc,J) < ^0'+T+2)fc, i J ) ' ' ' 1 d(j+k— l)fe }• (4) 

Assume that data node j fails. It is easy to see that the set 
Qj is the set of k — t 1 data symbols that are not recovered 
using Class A parity nodes. 

Example 2: For the example in Fig. [2] the set Qj is indi¬ 
cated by hatched symbols for each column j, j = 0, ..., k— 1 . 
For instance, Q 0 = {d 2 ,o, d 3 ,o, cZ 4 ,o}- 

For later use, we also define the following set. 

Definition 4: For j = 0. 1, define the set Xj as 

^3 = {4n(j+i)k 1 ^i,(i+2)fe >''' i dj,(j-\-k—T- i) fc }■ (5) 

Note that Xj = IZj Cl {U;Qi}. 

Example 3: For the example in Fig. |2] the set Xj is in¬ 
dicated by hatched symbols for each row i. For instance, 
Xq = TZq fl {Qo U Qi U Q 2 U Q 3 U Q 4 } = {c?o,i, do, 2 , do, 3 }. 

The purpose of Class B parity nodes is to allow recovering 
of the data symbols in Qj, j = 0,..., fc — 1, at a low additional 
read cost. Note that after recovering r + 1 symbols using Class 
A parity nodes, the data symbols in 7 Zj are already stored 
in the decoder memory, therefore they are accessible for the 
recovery of the remaining k — r — 1 data symbols using Class 
B parity nodes without the need of reading them again. The 
main idea is based on the following proposition. 

Proposition 1: If a Class B parity symbol p B is the sum of 
one data symbol d G Qj and a number of data symbols in 
Xj, then the recovery of d comes at the cost of one additional 
read (one should read parity symbol p B ). 

This observation is used in the construction of Class B parity 
nodes (see Section IIV-AI below) to reduce the normalized 
repair bandwidth, A. In particular, we add k — r — 1 Class B 
parity nodes which allow to reduce the additional read cost of 
all k(k — t — 1) data symbols in all Q,’s to 1. (The addition 
of a single Class B parity node allows to recover one new 
data symbol in each Qj, j = 0,..., k — 1, at the cost of one 
additional read). 


In order to describe the code construction, we define the 
function read(d, p B ) as follows. 

Definition 5: Consider a Class B parity node and let V B 
denote the set of parity symbols in this node. Also, let d G Qj 
for some j and p B G V B be p B = d + ^where 

V C V, i.e., the parity symbol p B is the sum of d and a 
subset of other data symbols. Then, 

read(d,p B ) = \T>\Xj\, ( 6 ) 

where V = {V U d}. 

For a given data symbol d, the function read(d,p B ) gives 
the additional number of symbols that need to be read to 
recover d (considering the fact that some symbols are already 
cached in the memory). 

A. Construction Example 

In the following, we propose a recursive algorithm for the 
construction of Class B parity nodes. To ease understanding, 
we introduce the algorithm through an example. 

We construct a (10, 5) code starting from the (7, 5) Class A 
code in Fig. [2] In particular, we construct k —t — 1 = 3 Class B 
parity nodes, so that the additional number of reads to repair 
each of the remaining failed k — t — 1 = 3 symbols (after 
recovering r + 1 = 2 symbols using Class A parity nodes) is 
1. With some abuse of notation, we denote these parity nodes 
b y ^ 7 , Pf, and Tf. 

Denote by A, ajj = [A] l:1 , a temporary matrix of read 
values for the respective data symbols djj. After Class A 
decoding, 

{ 00 if dij G {U t Q t } 
k if i = j (7) 

1 otherwise, 

where t = 0,..., k — 1. For our example, A after Class A 
decoding is given in Fig. [3ja). Our algorithm operates on the 
dij s whose initial value is 00 and aims to obtain the lowest 
possible values for these a,;.jS under the given number of Class 
B parity nodes. This is done in a recursive manner as follows. 
1. Construct the first parity node, V B . 
la For each symbol dj : j define the set 'D, :l = 

{d(i+s) k ,(j+s) k } s =o • 

lb Start with the elements in Qo- Pick an element dip G Qo 
such that a,; o = 00 , and doj G Ao\2\o- For instance, 
we take d 2 ,o- 

lc For t = 0,..., k — 1 compute 

Pt,7 ~ d(i+t) k ,t + d t ^ i+t j k (8) 

and update the respective a h Q and ao,i, 

a (i+t) k ,t a t,(i+t) k iead(d(j_|-t) fc ,t, , 7 ) • (^) 

The resulting matrix A is shown in Fig. [T]b). There are 
still entries = 00 that need to be handled. 

Id For t = 0, — 1 update 

Pt,7 ~ Pt,7 + ^t,(i’+t) k i 


( 10 ) 
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(a) Initial. (b) Step lc. (c) Step Id. (d) Step 2b. (e) Step 3b. 

Fig. 3: Update of A during the construction of Class B parity nodes for the example in Section II V-A I The updates of a% j after each step are highlighted in 
red color. The shaded symbols in column j denote the set Qj , while the shaded symbols in row i denote the set X{. 
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Fig. 4: Class B parity nodes for the data nodes in Fig. [2] 

where do,*' £ X-o and ao,*< = oo after step lb. Update 
A accordingly (see Fig. [3jc)). Note that the read values 
a (i+t) k ,{j+t) k have not worsened. This comes from the 
fact that the new added data symbol belongs to the cor¬ 
responding set X and is already cached in the memory. 
Thus, the additional read cost is 0. On the other hand, 
the values a {j+t)k ^ l+t)k increase. 

2. Construct the second parity node, Vi. 

2a Pick an element d,o £ Qo such that the corresponding 
dij is maximal. In our example, this is <£ 4,0 because 
04,0 = 3. 

2b For t = 0,. .., k — 1, do the following. Pick an element 
dt,(u+t) k £ Xt\D. L j such that for all dp _j> £ V, 
read (dpj>,ptp) < where p B is set to pf 8 = 

d(i+t) k ,t + d t ,( u +t) k - For our example, we choose d 0 p. 
Note that the only other option, dgp, is not a good choice 
as the new additional read cost would increase from 1 to 
2. If such d t ^ u+t)k does not exist, set pf 8 = d( i+t)fcit . 
Update A. The updated matrix is shown in Fig. QT} d). 

3. Construct Vg. 

3a Pick an element dip £ Qo such that the corresponding 
dip is maximal. In our example, this is < 3 ( 3 , 0 - 

3b For t = 0,..., k — 1, do the following, pf 9 = d(*+t) fc ,t- 
Update A. The resulting A has value k for all diagonal 
elements and 1 elsewhere (Fig. 0e)). 

The Class B parity nodes V B , V 8 , and Vg are shown in 

Fig. ID 

A general version of the algorithm to construct Class B 
parity nodes is given in Appendix [B] 


B. Discussion of the Construction Example 

The construction of Class B parity nodes starts by selecting 
an element dij of a given Qj such that a t ,j = 00 and 
djj £ Xj\Dij (for simplicity, as in the example, we can 
start with j = 0). The first parity symbol of f7 after step 
lc is therefore po ,7 = dip + do,*, and the remaining parity 
symbols are obtained as in ©. By PropositionQ]the additional 
read cost of dij (after step lc) is 1. The reason for selecting 
dj y i £ Xj\T>ij is due to the fact that, again by Proposition Q] 
its additional read cost is also 1. We remark that for each 
dij £ Qj it is not always possible to select djp £ X 3 \T> t j 
and set pj t 7 = dij + djp. This is the case when k < 2(r + 1). 
If djj £ Xj\T>ij does not exist, then we select djp £ Xj\T> t j 
(see Appendix IbIi. In this case, the additional read cost of djp 
(after step lc) is > 1 . 

In general, step Id has to be performed Q ;i |—2 times, 
corresponding to the number of entries a^j = 00 per column 
of A. 

Adding k — r — 1 Class B nodes allows to reduce the 
additional read cost for all data symbols in all Qj to 1 (see 
Fig. |3je)). However, this comes at the expense of a reduction 
in the code rate, i.e., the storage overhead is increased. In 
the example, k — r — 1 = 3 Class B parity nodes need to 
be introduced, which reduces the code rate from R = 5/7 to 
R = 5/10 = 1/2. If a lower storage overhead is required. 
Class B parity nodes can be punctured , starting from the 
last parity node (for the example, nodes Vg, Vf, and V B 
are punctured in this order), at the expense of an increased 
repair bandwidth. If all Class B parity nodes are punctured, 
we would remain only with Class A parity nodes and the 
repair bandwidth corresponds to that of the Class A code. 
Thus, our code construction gives a family of rate-compatible 
codes which trades off between repair bandwidth and storage 
overhead: adding more Class B nodes reduces the repair 
bandwidth but increases the storage overhead. 

C. Repair of a Single Node Failure: Decoding Schedule 

The repair of a failed systematic node, proceeds as follows. 

First, r + 1 symbols are repaired using Class A parity nodes. 
Then, the remaining symbols are repaired using Class B parity 
nodes. With a slight abuse of language, we will refer to the 
repair of symbols using Class A and Class B parity nodes 
as the decoding of Class A and Class B codes, respectively. 
Suppose that node j fails. Decoding is as follows. 

• Decoding of Class A code. To reconstruct the failed data 
symbol in the yth row of the code array, k symbols (k — 1 






















































data symbols and p^ k ) in the yth row are read. These 
symbols are now cached in the memory. We then read the 
r piggybacked symbols in the yth row. By construction 
(see ©), this allows to repair r failed symbols, at the 
cost of an additional read each. 

. Decoding of Class B code. Each remaining failed data 
symbol d l:3 £ Qj is obtained by reading a Class B parity 
symbol whose corresponding set V (see Definition 0 
contains dij. In particular, if several Class B parity 
symbols pf, contain dij, we read the parity symbol 
with largest index j'. This yields the lowest additional 
read cost. 

V. Code Characteristics and Comparison 

In this section we characterize some different properties of 
the codes presented in Sections [III] and HV1 

A. Fault Tolerance 

The fault tolerance of the Class A code depends on the 
MDS code used in its construction and r, as stated in Theorem 
H] Hence, our proposed code has also fault tolerance / > 
n/\ — k — t + 1. Since 1 < t < n/\ — k — 1, our codes have a 
fault tolerance of at least 2 . 

B. Normalized Repair Bandwidth 

According to Section I1V-CI to repair the first r +1 symbols 
in a failed node requires that k — 1 data symbols plus r + 1 
Class A parity symbols are read. The remaining k — T — 1 data 
symbols in the failed node are repaired by reading the Class B 
parity symbols. As seen in Section [IV] the parity symbols in 
the first Class B parity node are constructed from sets of data 
symbols of cardinality \Qj\= k — t — 1. Therefore, to repair 
each of the k — r — 1 data symbols in this set requires to 
read at most k — r — 1 symbols. The remaining Class B parity 
nodes are constructed from fewer symbols than k — T—1. An 
upper bound on the normalized repair bandwidth is therefore 
A < (k + t + (k — t — 1 ) 2 )/k. It is observed that when r 
increases, the fault tolerance reduces while A improves. 

C. Repair Complexity of a Failed Node 

We first consider the complexity of elementary arithmetic 
operations of elements of size v = [log 2 q p ~\ in ¥ qP . An 
addition requires O(v) and multiplication requires 0(ir). The 
term inside Of) denotes the number of elementary binary 
additions. To repair the first symbol requires k multiplications 
and k — 1 additions. To repair the following r symbols 
require an additional rfc multiplications and additions. The 
final k — T — 1 symbols require at most k — r — 2 additions, 
since Class B parity symbols are constructed as the sum of at 
most k — T — 1 data symbols. The repair complexity of one 
failed node is therefore 

Cr = 0 ((k — l)v + ku 2 ) + 0(rk(y + v 2 )) + 0 ((k — r — 2 ) 2 i'). 

( 11 ) 

The first two terms correspond to the Class A code while the 
last term corresponds to the Class B code. 


D. Encoding Complexity 

The encoding complexity of the in. k ) code, Ce, is the sum 
of the encoding complexities of the two codes. The generation 
of each of the n/\ — k Class A parity symbols in one row 
of the code array, p3,. in 0 , requires k multiplications and 
k — 1 additions. Adding data symbols to r of these parity 
symbols according to (0 requires an additional r additions. 
The encoding complexity of the Class A code is therefore 

Ca = 0((nA - k)[kv 2 + (k - 1 )^)) + Ofrv). ( 12 ) 

According to Section |IV] the parity symbols in the first 
Class B parity node are constructed as the sum of k — r — 1 
data symbols, and each parity symbol in the subsequent parity 
nodes uses one less data symbol. Therefore, the encoding 
complexity of the Class B code is 

n—n/\ 

C B = 0((h-T-l-i)v). (13) 

i=l 

Finally, Ce = Ca + Cb- 

E. Code Comparison 

Table U provides a summary of the characteristics of dif¬ 
ferent codes proposed in the literature as well as the codes 
constructed in this paperQ In the table, column 2 reports the 
value of 3 (see 0) for each code construction. For our code, 
3 = k, unlike for MDR and Zigzag codes, for which /3 
grows exponentially with k. This implies that our codes require 
less memory to cache data symbols during repair. The fault 
tolerance /, the normalized repair bandwidth A, the normalized 
repair complexity, and the encoding complexity, discussed in 
the previous subsections, are reported in columns 3, 4, 5, and 
6 , respectively. 

In Fig. 0 we compare our codes with other codes in the 
literature. In particular, the figure plots the normalized repair 
complexity of (n, k, /) codes over F 2 s (v = 8) versus their 
normalized repair bandwidth A. In contrast to the bounds for 
the repair bandwidth and complexity reported in TableQ] Fig.0 
contains the exact number of integer additions. 

The best codes for a DS system should be the ones that 
achieve the lowest repair bandwidth and have the lowest repair 
complexity. As seen in Fig. 0 MDS codes have both high 
repair complexity and repair bandwidth, but they are optimal 
in terms of fault tolerance for a given n and k. Zigzag codes 
achieve the same fault tolerance and high repair complexity 
as MDS codes, but at the lowest repair bandwidth. At the 
other end, FRCs yield the lowest repair complexity but a 
higher repair bandwidth and worse fault tolerance than Zigzag 
codes. Piggyback codes have a repair bandwidth between 
that of Zigzag and MDS codes, but with a higher repair 
complexity and worse fault tolerance. For a given storage 
overhead, our proposed codes have better repair bandwidth 
than MDS codes, Piggyback codes and LRCs, and equal or 
similar repair bandwidth than Zigzag codes. Furthermore, they 

4 The variables t,t r and r in Tableware defined in (7) and [2] respectively. 
The definition of t comes directly from r that is defined in 0. 




TABLE I: Comparison of (n, k) codes that aim at reducing repair bandwidth. The repair bandwidth and the repair complexity are normalized per symbol, 
while the encoding complexity is given per row in the code array. Note that for MDR codes n = k + 2. 
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Fig. 5: Comparisons of different codes (n, k. f) with v = 8. 


yield lower repair complexity as compared to MDS, Piggyback 
and Zigzag codes. However, the benefits in terms of repair 
bandwidth and/or repair complexity with respect to MDS and 
Zigzag codes come at a price of a lower fault tolerance. 

VI. Conclusion 

In this paper, we constructed a new class of codes that 
achieve low repair bandwidth and low repair complexity for 
a single node failure. The codes are constructed from two 
smaller codes. Class A and B, where the former focuses on 
the fault tolerance of the code, and the latter focuses on 
reducing the repair bandwidth and complexity. Our proposed 
codes achieve better repair complexity than Zigzag codes and 
Piggyback codes and better repair bandwidth than LRCs, but at 
the cost of slightly lower fault tolerance. A side effect of such a 
construction is that the number of symbols per node that needs 
to be encoded grows linearly with the code dimension. This 
implies that our codes are suitable for memory constrained DS 
systems as compared to Zigzag and MDR codes, for which the 
number of symbols per node increases exponentially with the 
code dimension. 


Appendix A 
Proof of TheoremD] 

Each row in the code array contains n/\ — k — t parity 
symbols based on the MDS construction (i.e., parity symbols 


without piggybacks). Using these symbols, one can recover 
n/\—k—T data symbols in that row and, thus, n^—k—r failures 
of systematic nodes. In order to prove the theorem, we need 
to show that by using piggybacked parity symbols pi tU , i = 
0 ,..., k— 1 , in some parity node, u, it is possible to correct one 
arbitrary systematic node failure. To do this, let us consider 
the system of linear equations Gd J = p T , representing the 
set of parity equations to compute pi tU s where u = — r. In 

other words, d = (d 0 ,o, • • ■, do.fc- 1 , di,o, ■ ■ ■, <4-i,k-i), P = 
(po,u, ■ ■ ■ ,Pk-i,u), and G is given by 
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where a = (ao,u> • ■ • > a fc-i,u)> u i is a vector of length k 
with one at position i and zeros elsewhere, and 0 is the all¬ 
zero vector of size k. Now, assume a systematic node r has 
failed. In order to repair it, we need to solve the following 
subsystem of linear equations G'w J = p T , in which w = 
(do,r> ■ ■ •, dk-i, r ) and G 1 is a k x k submatrix of G such 
that: a) its diagonal elements are all a r y, b) it has 1 at row 
r and column (r + l) k ’, c) all other entries are 0. Note that 
G' is full rank. Therefore, one arbitrary data symbol can be 
corrected and, hence, the erasure correcting capability of Class 
A code is at least n/\ — k — t + 1, which completes the proof. 

Appendix B 

Algorithm to Construct Class B Parity Nodes 

We give an algorithm to construct k — r — 1 Class B 
parity nodes in the order V^,P^ +1 ,... ,V^ +k _ T _ 2 . This 
results in the construction of (k — r — 1 )k parity symbols 
pfj. The algorithm is given in Algorithm |T] Consider the 
construction of the parity symbols of parity node V nA . The 
algorithm constructs first the parity symbol p® „ A as the sum of 
an element d^o G Qo and max_itr elements in Xq. Then, the 
other parity symbols pf n , t > 0, are constructed as the sum 
of an element d( i+t \ k , t £ Qt and max_itr elements in X t , i.e., 
following a specific pattern. The remaining parity nodes are 
constructed in a similar way, with the only difference that the 
number of elements added from the sets X t , max_itr, varies 
for each parity node. The construction of the parity symbols 
pfj depends on the choice of the symbols in the sets Q t and 

















Algorithm 1: Construction of Class B parity nodes 

Initialization: 


Vi,j = 0,... ,k - 1 

ai,j as defined in <[7| 

= {^(j+s)fc,(i+s)fe }s=0 
rnax_itr = k — r — 2 

l for u; 77 -A to nA + k — r — 2 do 

ill construct /c — r — 1 nodes 

3 choose di,o G Qo s.t. a*,o is max && do,i £ ^o\A,o 

4 if d 0 , « 0 ^b\^,o then choose d i?0 £ Qo s.t. a*,o is 
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for t 4 - 1 to /c — 1 do 


Pt,u 


= d, 


(i+t)k,t 


end 

for •<— 1 to max_itr do 
temp = + d 0 ,i 

if Ur = 1 && do.,; € Ao\2?po && 
read(do,i,temp) < ao,t then 
i' t— i 
Po,a; = 

Oo,i' = «i ',0 = read(d 0 ,p,Po,J = 1 
for t <r- 1 to k — 1 do 
Ptt = Pit + d t ,(i' +t ) k 
a t,(i'+t) k = r ead(di,(i'+i) fc ,Pi B J 

«(i+i) fc ,t = read(d (i+t)(sj t,p t B w ) 

end 
else 

if 3doy £ Xo\Vi Q && 

read(d 0 ,i',Po w ) - a oy && a o,i' > 1 then 
Po,u> = Po,aj + do,i' 
ao ,i' = max_itr + 1 
«o,i = read(d 0 i i,pg jClJ ) 

for t <— 1 to k — 1 do 

Pit = Pit + rf i,(i'+t)io 
a t ,(i'+t) fc = maxjtr + 1 

a t,(z+t) fe =read(d t)(i+t)fc ,p t B w ) 

end 

end 

end 

end 

max itr <— max itr — 1 


additional read cost does not increase (line 20 in the 
algorithm). If such a condition is not met, then the symbol 
do,i' is n °t used in the construction of the parity symbol. 

After the construction of each parity symbol, the corre¬ 
sponding entry of matrix A is updated. 
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X t . Assume that a parity symbol ; is constructed. The data 
symbols involved in p B ; are picked as follows. 

• Choice of a data symbol in Qq: Select a symbol dpo G Qo 
such that the corresponding apo is maximum and there 
exists do,i £ A’o'yPpo (lines 2 and 3 in the algorithm). If 
the latter does not exists, then select dpo such that apo 
is maximum. Such a dpo always exist. 

• Choice of max_itr data symbols in Xq: Select max_itr 
symbols do,p £ Xq\"D,; q such that ao ,i’ > 1 and its 












